Streaming Binary Sketching based on Subspace Tracking and Diagonal Uniformization
نویسندگان
چکیده
In this paper, we address the problem of learning compact similarity-preserving embeddings for massive high-dimensional streams of data in order to perform efficient similarity search. We present a new method for computing binary compressed representations -sketchesof high-dimensional real feature vectors. Given an expected code length c and high-dimensional input data points, our algorithm provides a binary code of c bits aiming at preserving the distance between the points from the original high-dimensional space. Our offline version of the algorithm outperforms the offline state-of-the-art methods regarding their computation time complexity and have a similar quality of the sketches. It also provides convergence guarantees. Moreover, our algorithm can be straightforwardly used in the streaming context by not requiring neither the storage of the whole dataset nor a chunk. We demonstrate the quality of our binary sketches through extensive experiments on real data for the nearest neighbors search task in the offline and online settings.
منابع مشابه
Model Checking Markov Chains Using Krylov Subspace Methods: An Experience Report
The predominant technique for computing the transient distribution of a Continuous Time Markov Chain (CTMC) exploits uniformization, which is known to be stable and efficient for non-stiff to mildly-stiff CTMCs. On stiff CTMCs however, uniformization suffers from severe performance degradation. In this paper, we report on our observations and analysis of an alternative technique using Krylov su...
متن کاملBinary Coding in Stream
Big data is becoming ever more ubiquitous, ranging over massive video repositories, document corpuses, image sets and Internet routing history. Proximity search and clustering are two algorithmic primitives fundamental to data analysis, but suffer from the “curse of dimensionality” on these gigantic datasets. A popular attack for this problem is to convert object representations into short bina...
متن کاملNeural Network Performance Analysis for Real Time Hand Gesture Tracking Based on Hu Moment and Hybrid Features
This paper presents a comparison study between the multilayer perceptron (MLP) and radial basis function (RBF) neural networks with supervised learning and back propagation algorithm to track hand gestures. Both networks have two output classes which are hand and face. Skin is detected by a regional based algorithm in the image, and then networks are applied on video sequences frame by frame in...
متن کاملTransmission subspace tracking for MIMO communications systems - Global Telecommunications Conference, 2001. GLOBECOM '01. IEEE
This paper describes the benefits of transmission subspace tracking for multiple input multiple output communications systems, and applies the concepts of previous work on adaptive transmit antenna algorithms to this problem. A specific stochastic gradient technique of subspace or “multi-mode” tracking of the independent modes of the MIMO transfer function and the application to space time codi...
متن کاملTransmission subspace tracking for MIMO communications systems
This paper describes the benefits of transmission subspace tracking for multiple input multiple output communications systems, and applies the concepts of previous work on adaptive transmit antenna algorithms to this problem. A specific stochastic gradient technique of subspace or "multi-mode" tracking of the independent modes of the MIMO transfer function and the application to space time codi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1705.07661 شماره
صفحات -
تاریخ انتشار 2017